Policy gradient learning for a humanoid soccer robot

نویسندگان

Andrea Cherubini

Francesca Giannone

Luca Iocchi

M. Lombardo

Giuseppe Oriolo

چکیده

In humanoid robotic soccer, many factors, both at low-level (e.g., vision and motion control) and at high-level (e.g., behaviors and game strategies), determine the quality of the robot performance. In particular, the speed of individual robots, the precision of the trajectory, and the stability of the walking gaits, have a high impact on the success of a team. Consequently, humanoid soccer robots require fine tuning, especially for the basic behaviors. In recent years, machine learning techniques have been used to find optimal parameter sets for various humanoid robot behaviors. However, a drawback of learning techniques is time consumption: a practical learning method for robotic applications must be effective with a small amount of data. In this article, we compare two learning methods for humanoid walking gaits based on the Policy Gradient algorithm. We demonstrate that an extension of the classic Policy Gradient algorithm that takes into account parameter relevance allows for better solutions when only a few experiments are available. The results of our experimental work show the effectiveness of the policy gradient learning method, as well as its higher convergence rate, when the relevance of parameters is taken into account during learning.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement Learning for Humanoid Robotics

Reinforcement learning offers one of the most general framework to take traditional robotics towards true autonomy and versatility. However, applying reinforcement learning to high dimensional movement systems like humanoid robots remains an unsolved problem. In this paper, we discuss different approaches of reinforcement learning in terms of their applicability in humanoid robotics. Methods ca...

متن کامل

Study of Evolutionary and Swarm Intelligent Techniques for Soccer Robot Path Planning

Finding an optimal path for a robot in a soccer field involves different parameters such as the positions of the robot, positions of the obstacles, etc. Due to simplicity and smoothness of Ferguson Spline, it has been employed for path planning between arbitrary points on the field in many research teams. In order to optimize the parameters of Ferguson Spline some evolutionary or intelligent al...

متن کامل

Policy Gradient Methods for Robot Control

Reinforcement learning offers the most general framework to take traditional robotics towards true autonomy and versatility. However, applying reinforcement learning to high dimensional movement systems like humanoid robots remains an unsolved problem. In this paper, we discuss different approaches of reinforcement learning in terms of their applicability in humanoid robotics. Methods can be co...

متن کامل

Robo-Erectus: a low-cost autonomous humanoid soccer robot

The humanoid soccer robot league is a new international initiative to foster robotics and AI technologies using soccer games [1]. This paper provides a brief description of a low-cost autonomous humanoid soccer robot called Robo-Erectus (RE), which has been developed in the Center for Advanced Robotics and Intelligent Control (ARICC) at Singapore Polytechnic since 2001. To develop a low-cost hu...

متن کامل

Episodic Reinforcement Learning Control Approach for Biped Walking

This paper presents a hybrid dynamic control approach to the realisation of humanoid biped robotic walk, focusing on the policy gradient episodic reinforcement learning with fuzzy evaluative feedback. The proposed structure of controller involves two feedback loops: a conventional computed torque controller and an episodic reinforcement learning controller. The reinforcement learning part inclu...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Robotics and Autonomous Systems

دوره 57 شماره

صفحات -

تاریخ انتشار 2009

Policy gradient learning for a humanoid soccer robot

نویسندگان

چکیده

منابع مشابه

Reinforcement Learning for Humanoid Robotics

Study of Evolutionary and Swarm Intelligent Techniques for Soccer Robot Path Planning

Policy Gradient Methods for Robot Control

Robo-Erectus: a low-cost autonomous humanoid soccer robot

Episodic Reinforcement Learning Control Approach for Biped Walking

عنوان ژورنال:

اشتراک گذاری